Planning with an Adaptive World Model
نویسندگان
چکیده
We present a new connectionist planning method [TML90]. By interaction with an unknown environment, a world model is progressively constructed using gradient descent. For deriving optimal actions with respect to future reinforcement, planning is applied in two steps: an experience network proposes a plan which is subsequently optimized by gradient descent with a chain of world models, so that an optimal reinforcement may be obtained when it is actually run. The appropriateness of this method is demonstrated by a robotics application and a pole balancing task.
منابع مشابه
An economic-statistical model for production and maintenance planning under adaptive non-central chi-square control chart
Most of the inventory control models assume that quality defect never happens, which means production process is perfect. However, in real manufacturing processes, the production process starts its operation in the in-control state; but after a period of time, shifts to the out-of-control state because of occurrence of some disturbances. In this paper, in order to approach the model to real man...
متن کاملAdaptive aggregate production planning with fuzzy goal programming approach
Aggregate production planning (APP) determines the optimal production plan for the medium term planning horizon. The purpose of the APP is effective utilization of existing capacities through facing the fluctuations in demand. Recently, fuzzy approaches have been applied for APP focusing on vague nature of cost parameters. Considering the importance of coping with customer demand in different p...
متن کاملAn integrated production and preventive maintenance planning model with imperfect maintenance in multi-state system
Production planning and maintenance are two important problems in manufacturing systems. Despite the relationship exists between these two problems due to sudden failures and production capacity occupied by maintenance activities, each of these problems planned separately and as a result program and model efficiencies reduce in the real world. The aim of integrated production and maintena...
متن کاملModelling the formation of Ozone in the air by using Adaptive Neuro-Fuzzy Inference System (ANFIS) (Case study: city of Yazd, Iran)
The impact of air pollution and environmental issues on public health is one of the main topics studied in manycities around the world. Ozone is a greenhouse gas that contributes to global climate. This study was conducted topredict and model ozone of Yazd in the lower atmosphere by an adaptive neuro-fuzzy inference system (ANFIS). Allthe data were extracted from 721 samples collected daily ove...
متن کاملAdaptive Predictive Controllers Using a Growing and Pruning RBF Neural Network
An adaptive version of growing and pruning RBF neural network has been used to predict the system output and implement Linear Model-Based Predictive Controller (LMPC) and Non-linear Model-based Predictive Controller (NMPC) strategies. A radial-basis neural network with growing and pruning capabilities is introduced to carry out on-line model identification.An Unscented Kal...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1990